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We consider a stochastic model for directed scale-free networks following power-laws in the degree 
distributions in both incoming and outgoing directions. In our model, the number of vertices grow 
geometrically with time with growth rate p. At each time step, (i) each newly introduced vertex is 
connected to a constant number of already existing vertices with the probability linearly proportional 
to the in-degree of a selected vertex, and (ii) each existing vertex updates its outgoing edges through 
a stochastic multiplicative process with mean growth rate of outgoing edges g and variance a 2 . Using 
both analytic treatment and numerical simulations, we show that while the out-degree exponent 7 ou t 
depends on the parameters, the in-degree exponent 7i n has two distinct values, 7i n = 2 for p > g and 
1 for p < g, independent of different parameters values. The latter case has logarithmic correction 
to the power-law. Since the vertex growth rate p is larger than the degree growth rate g for the 
world-wide web (www) nowadays, the in-degree exponent appears robust as 7j n = 2 for the www. 

PACS numbers: PACS numbers: 89.75.Hc, 89.70+c,87.18.Sn,89.75.Da 



I. INTRODUCTION 

Complex system consists of many constituents such 
as individuals, substrates, and companies in social, 
biological, and economic systems, respectively, showing 
cooperative phenomena between constituents through 
diverse interactions and adaptations to the pattern they 
create |jl ^). Recently there have been considerable 
efforts to understand such complex systems in terms 
of random graph, consisting of vertices and edges, 
where vertices (edges) represent constituents (their 
interactions) . This approach was initiated by Erdos and 
Renyi (ER)||. In the ER model, the number of vertices 
is fixed, while edges connecting one vertex to another 
occur randomly with certain probability. The ER model 
is however too random to describe complex systems in 
real world. 

An interesting feature emerging in such complex 
systems is the scale-free (SF) behavior in the degree 
distribution, P(k) ~ /c -7 , where the degree k is the 
number of edges incident upon a given vertex. Barabasi 
and Albert (BA)f§ § introduced an evolving model 
illustrating SF network. In the BA model, the number 
of vertices increases linearly with time, and a newly 
introduced vertex is connected to m already existing 
vertices, following the so-called preferential attach- 
ment (PA) rule that the vertices with more edges are 
preferentially selected for the connection to the new 
vertex with the probability linearly proportional to 
the degree of that vertex. Then it is known that the 
degree distribution follows P(k) ~ k~ 3 for the BA 
model. While the BA model is meaningful as the first 
step to generate SF network, it is too simple to be in 
accordance with the real-world networks. Extended 



versions of the BA model have been introduced || 
taking into account of additional local events such as 
adding new edges, or rewiring edges from one vertex to 
another. Depending on the frequency of these processes, 
the degree distribution either remains as SF with the 
exponent depending on the details of the local event or 
follows an exponential decay. 

Huberman and Adamic (HA)j^] proposed another 
scenario for SF networks. In the HA model, the number 
of vertices grows geometrically with time, and edges of 
each vertex evolve following a stochastic multiplicative 
process. Combining these two ingredients leads to a 
power-law behavior in the degree distribution, where the 
exponent is determined by the growth rates of vertices, 
and the mean degree and variance of the fluctuations 
arising in the stochastic process of updating edges. 
While the HA and BA models look fundamentally 
different at a fist glance, they are similar in essence. 
One can show easily that the multiplicative process is 
reduced to the PA rule when the time dependence of 
the total number of edges is the same as that of the 
number of newly introduced vertices. Moreover, the 
stochastic process in the HA model might be related to 
the rewiring process in the extended model of the BA 
model§. 

SF networks may be classified into undirected or 
directed network whether the directionality is assigned 
to edges or not. Typical examples of undirected networks 
include the actor network the author collaboration 
network |l0|, and the Internet with equal uploading 
and downloading rates|ll . Directed networks are 



also ubiquitous in real world such as the world-wide 
web (www)[|[ [l2| the citation network of scien- 
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tific pap ers |1 4 |, biological networks such as metabolic 
networks [|15| and neural networks, etc. Recently, Albert 
et aL[|l2| and Huberman et at. || investigated the 
topology of the www extensively, and found that the 
in-degree and the out-degree distributions of the www 
exhibit power-law behaviors with different exponents, 
i.e., -PinOin) ~ fc" 7 " 1 and P nt(k ut) ~ k~uC\ respec- 
tively. Here the in-degree k- ln (out-degree fc out ) means 
the number of edges incident upon (emanating from) a 
given vertex. Further studies |l3|, [ll| |l7j showed that 7i„ 
is robust as 7; n « 2.1 for different systems, while 7 ou t 
varies depending on systems in the range, 2.4 ~ 2.7. 

Theoretical studies for directed networks have less 
been carried out compared with those for undirected 
networks. When the directionality is assigned to edge 
in the BA model, pointing from a new vertex to old 
ones, the in-degree and the out-degree distributions 
follow Pin(fcin) ~ fc^ 3 and Pout(fcout) = ^(^out - m) 
respectively, which is not relevant to the empirical 
results for the www. Dorogovtsev and Mendes[l§|] 
performed a similar study using the rate equation, in 
which the in-degree distribution follows a power-law 
whereas the out-degree distribution is of the 5-function. 
More recently, Krapivsky et al. [|l9| studied directed 
SF networks using the rate equation method for the 
simple case similar to the one introduced by Tadic[pO| 
that at each time step, a vertex is newly introduced and 
connected to an old vertex following the PA rule with 
a certain probability and an internal directed edge is 
connected between two vertices chosen following the PA 
rule with the remaining probability. They obtained the 
in-degree and the out-degree distributions analytically, 
both of which exhibit power-law behaviors with different 
exponents depending on the detail of the parameters 
they used. While their analytic treatment was successful 
in generating the empirical values of the out-degree 
and the in-degree exponents for the www by tuning 
the parameters, their model is unable to illustrate the 
robustness of the in-degree exponent for various systems 
because tuning parameters leads to different values of of 
7in and 7 ou t at the same time. 

In this paper, we introduce a stochastic model for 
directed SF networks exhibiting power-law behaviors 
with distinct exponents in both incoming and outgoing 
directions and present an analytic solution for the 
model. Through this study, we can illustrate why 
the in-degree exponent is robust for different systems, 
while the out-degree exponent depends on the details of 
systems. This behavior occurs when the growth rate of 
the number of vertices is large enough compared with 
the effective growth rate of degree of each vertex. 

This paper is organized as follows. In section II, we 
will introduce a stochastic model. In sections III and IV, 



analytic solutions for the out-degree and the in-degree 
distributions will be presented, respectively. In section 
V, we will present the result of numerical simulations for 
the model in the vertex growth dominant and the degree 
growth dominant regimes, respectively. The final section 
will be devoted to the conclusions. 

II. THE MODEL 

Let us introduce a directed SF network model as fol- 
lows: (i) At each time step, the total number of vertices 
increases geometrically with growth rate p, i.e., 

N(t) = N(t-l)(l+p). (1) 

So the total number of vertices newly introduced at time 
t is pN(t—l). (ii) m edges emanate from each new vertex, 
pointing to m distinct old vertices following the PA rule. 
The probability to connect to a vertex j is given by 

= N k ;^ {t ~ l) , (2) 

where k{ n j(t — 1) means the in-degree of the vertex j at 
time t — 1. We assume in the model that each new vertex 
is given an incoming edge pointed from itself, otherwise 
in-degree never grows with time, (iii) each vertex up- 
dates its outgoing edges by either adding new edges or 
deleting existing edges through a multiplicative stochas- 
tic process. Let k outi i(t) denote the out-degree of vertex 
i at time t. Then fc ou t,i(i) evolves as 

*but,i(* + 1) = feout,i(*)(l + (i(t + 1)), (3) 

where Q(t) means the growth rate of the out-degree 
k ut,i(t) at time t, which fluctuates from time to time 
about mean 

Ci(*)= Si + &(*), (4) 

where £,i{t) is assumed to be a white noise satisfying 
(&(«)) = and (&(%(<')) = <>i8t,vSij, where of is 
the variance. The growth rate gi and the standard devi- 
ation Cj could vary in general for different vertices. HA, 
however, assumed that {Q} are uniform for different ver- 
tices, i.e., gi = g and o* = a for all i. When Q(t + 1) > 0, 
the out-degree at vertex i is increased. Then we add 
k ou t,i(t)(i(t + 1) new edges to the vertex i, pointing to 
other distinct vertices which are not connected, according 
to the PA rule given by Eq.(§). When &(*+!) 

< 0, we 

delete k outi i(t)\(i(t + 1)| outgoing edges from the vertex 
i randomly. 

III. THE OUT-DEGREE DISTRIBUTION 

The out-degree distribution P ut(k ut) can be obtained 
by following the argument given by HA. The conditional 
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probability P ut(fcout,T | m) that k out ,i = fc out at time 
t = ti + t for a vertex i born at t = ti with k ont ,i = m is 
given by 



(k out ,T\rnj 



k outy /2ira%T 



exp • 



(In (fcput/m) - g r) 2 
2^F 



}(5) 



The above distribution was obtained by applying the cen- 
tral limit theorem for the variable rn(fc ou t(f)/fc ou t(£ — 1))' 
so that go and <7q in Eq.(||) are related to g and a 2 as 
.9o ~ 9 — (j2 /2, and ctq ~ ° 2 1 respectively ^2lf. Since 
the density of vertices with age r is proportional to 
p(r) ~ exp(— pr), the out-degree distribution collected 
over all ages becomes 

^out(fcout) = J dTp(T)P out (k out ,T\m) ~ fe~ u t°"* 5 (6) 
where 

Tout = 1 H 2 • (') 



We note that the out-degree exponent 7 ou t depends on 
the three parameters, p, g$ and cr - 



IV. THE IN-DEGREE DISTRIBUTION 

The in-degree at a vertex i is increased as new edges 
are additionally pointed from other vertices to i, or de- 
creased as already connected edges are deleted from other 
vertices. For the increased case, there are two types of 
occasions. The first is the case that some of edges from 
newly born vertices are connected to the vertex i. Since 
the total number of edges generated from new vertices at 
time t is given by 



£ ncw (t) = mpN(t - 1), 
the in-degree of the vertex i evolves as 



dt 



(8) 



£nc„W- (9) 



Second is the case that the vertex i receives edges from 
existing vertices as they update their outgoing edges. 
The total number of newly added outgoing edges is given 
by 

N(t-1) 

£add(*)= fc ou tj (t-l)C+W, (10) 

3=1 

where Cj(t) denotes the one when £?'(*) > 0. Then, the 
in-degree of the vertex i evolves as 



dh^jjt) _ k in .j{t - 1) 



c add {t). (ii) 



On the other hand, the decreased case occurs when other 
vertices remove their connections to the vertex i. This 
case occurs when Q(t) < for a vertex j ^ i, with 
denoted by Cj r W- The total number of edges removed 
through this updating process is 

N(t-l) 

£del(i)= J2 fc outj(t-l)K7(*)|. (12) 
3=1 

Although the edges deleted are chosen randomly, the ver- 
tex with more in-degree has more incoming edges deleted 
because incoming edges were formed following the PA 
rule. Thus the deletion process leads to 



dk ln ,i(t) 
dt 



— -1 ^in,r(^ 1 



-C del (t). (13) 



Altogether the dynamic equation for the in-degree of the 
vertex i is written as 



dk in ,j(t) 
dt 



kin,i{t 1) 



y^., — 1 ^ kin,r(t 1) 



^new (^) 



+C add (t)- £ dcl (t)j. (14) 
The above equation can be rewritten as 

-^^ = fci ^ (t - 1) ( L{t) +90 + W))' ( } 

where L(t) denotes the total number of incoming edges 
at time t, 



N{t) 

L(t) = hn,i(t), 

i 

which behaves asymptotically as 

(A ie P\ ifp><? , 
L(t)*i A 2 teP\ i£p = g , 
[A 3 e9o\ ifjj<<? , 

where A±, A 2 , and A3 are given as 

A _ mpN(0) 



(16) 



(17) 



(P - 9o) 
A 2 = mpN(0), 



and 



A, = 



mpN(0) 
(.9o - p) ' 



(18) 
(19) 

(20) 



x(t) in Eq.(|rj) is defined as 

N(t-1) 



X 



(t)= J2 kaut,i(t-l)(tt-\&\), (21) 
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where £ t + (t) (£ t r (t)) denotes the noise for £i(t) > 
(£i(t) < 0). Then using the stochastic property, (&) = 0, 
we obtain that 



(*(*)) = o, 



(22) 



and 

(x(*)x(f)> 



B^S^,, iip> 2(. 9o + a 2 /2), 

B 2 teP t 5 t , v , ifp = 2(. 9o + <x§/2), 

S 3e 2 ^+-o/2)t5 t i/) if p < 2(50 + ^/2), 

(23) 

where Si, S2, andi?3 are given by 

m 2 aQpNo 



and 



Si 
S 2 



2(p - a 2 - 2g ) ' 
m 2 <x 2 piV (l+;p) 2 , 

m 2 alpN a 
2{a 2 + 2g Q -pY 



(24) 
(25) 

(26) 



Thus x{t) pl a ys a role of noise, and its variance depends 
on time. 

The asymptotic behavior of the dynamic equation 
Eq.(|l5|) depends on relative magnitudes among p, go, 
and cr 2 . We consider every possible case below. 

(I) When p > go + (T 2 /2{i.e., p > g), the stochastic 
term, the last term in Eq. (|l5|) , is negligible in long time 
limit. Moreover, since N(t) and L(t) have the same time- 
dependence, Eq.([l5|) is simply reduced to 



at 



■ pk iaii (t). 



(27) 



Thus the in-degrec of a vertex i born at time t = ti 
becomes 



- Mt-h) 



(28) 



Then the in-degree distribution becomes 
8 



Pin (&ii 



d 



(1 -P(k- m > hn,i)) fc ilM =fc ln (29) 



-) \k iDti =k ia 



(30) 



with 7j e 



(II) When go < p < go + 4/2 (i.e., g - a 2 /2 <p<g), 
the dynamic equation is reduced to asymptotically 



dt 



i(t)(p- 



L(t) 



(31) 



Since (%(£)) = 0, one may regard the above equation 
as a stochastic log-normal dynamic equation with the 
variance, 



(x(t)x(t')) 

L(t) 2 



V 2 e 2st 5 t , t ,, 



(32) 



with V\ = B 3 /A 2 and s = g + Oq/2 — p. Since 
s > 0, the fluctuation term cannot be ignored. Invok- 
ing the central limit theorem, the conditional probability 
Si n (fci n , r|fci nj o), that fci n .i = k- m at time t = ti + t, given 
k-m.i = h ni o = 1 at t = ti becomes, 



Pin ( kin, T~\ki n ,0 ) — 
1 



: exp • 



(In (kin/kinfl) - pr) 



k in ^2nV 2 (e^ -l)/2s I 2V 2 (e^ _ l)J 
So the in-degree distribution can be obtained through 

Sin(fcin)= / p(r)P in (fc in ,r|fc in ,o)dr. (34) 
Jo 

When 2st ^> 1, it can be shown using the saddle point 
approximation that the in-degree distribution is of the 
form, 



Pm{kb 



1 



k in (lnk in )(p/^y 



(35) 



which is valid as long as lnfc; n <§; e . When p = go, Df 
is replaced by B3/A 2 . 

(Ill) For p < go (i.e., p < g — cr 2 /2), the dynamic 
equation of k lri ^(t) can be written as 



dh n: i(t) 



kin,i(t)(go 



x(t) 



dt ' m ' H 'l 3U ' L(t) 
The variance can be written as 

= Vie°o t 8 t , t ,, 



(X(t)x(t')) _ V 2 r ^> 



(36) 



(37) 



with V?, = B3/A 2 . Following the same step as used in 
the second case, we obtain that 



Sin (fci n ) 



1 



fc in (lnfc in )( 2 9°/ ,T o+ 1 ) 



(38) 



Conclusively, when the growth rate of vertex p is larger 
than the effective growth rate of edge, go + ctq/2, the in- 
degree distribution is independent of the detail of evolv- 
ing networks, so that the in-degree exponent is robust for 
different systems, while the out-degree exponent depends 
on the detail. This is the case we observe in the real www 
because the number of wcbpagcs increases rapidly nowa- 
days, whereas average number of hyperlinks does rather 
at a slower rate due to limited space on webpage. When 
the number of webpages is saturated in the future, the 
growth rate p will become moderated with the number of 
hyperlinks go much dominant. Then the in-degree distri- 



bution exhibits a phase transition to the form Eq.(35) or 
(|38|), implying that the hyperlink is much centralized to 
a few famous webpages. The phase diagram is depicted 
in Fig.l. 
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V. NUMERICAL SIMULATIONS 
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It was reported [0 that the Web consisted of N = 
203 x 10 6 documents from the point of view of Al- 
tavista, and the average in-degree and out-degree are 
k in = fc out = 7.22 as of May of 1999, and N = 271 x 10 6 
and k- m = fc out = 7.85 as of October of 1999. Based on 
this data, we estimate very roughly the vertex and mean 
degree growth rates to be p 0.059 and g « 0.017 per 
mon th, rcapcctivcly. However, the fluctuation atrcngth a 
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is no t known. Uaing the catimatcd valuca of p and g, we 
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to be in the regions (I) and (II). The simulation results 
are compared with the theoretical predictions. First, we 
choose the variances Cq = 0.052 and 0.021 belonging to 
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VI. CONCLUSIONS 

We have introduced a stochastic model for directed SF 
networks, which evolves with time. In our model, the evo- 
lution of outgoing edges follows the stochastic multiplica- 
tive process, while that of incoming edges does the pref- 



erential attachment. Willi tffls model, we Could illustrate 
why the in-degree' exponent for ihe www ts robusi, inde- 
pendent of different systems, while the out-degree expo- 
nent depends on different systems. We presented analytic 
results for both the in-degree and the out-degree distri- 
bution and confirmed our theoretical predictions by per- 
forming numerical simulations with the parameter values 
estimated from the www in real world. 



[9 

[io; 
in 

[12; 

[13 



[14 
[15 

[16. 



[17 



M 



TT9 

[20 
[21 



S.H. Strogatz, Nature 410, 268 (2001). 
N. Goldenfeld and L.P. Kadanoff, Science 284, 87 (1999) 
P. Erdos, A. Renyl, Publ. Math. Inst. Hung. Acad. Sci 
5, 17 (1960). 

A.-L. Barabasi, R. Albert and H. Jeong, Physica A, 272, 
173 (1999); R. Albert and A.-L. Barabasi, flcond 



mat/0106144) 



A. -L. Barabasi and R. Albert, Science 286, 509 (1999). 
R. Albert and A.-L. Barabasi, Phys. Rev. Lett. 85, 5234 

(2000) . 

G. Bianconi and A.-L. Barabasi, Europhys. Lett. 54, 436 

(2001) . 

B. A. Huberman and L.A. Adamic, Nature 401 . 131 
(1999); B. A . Huberman and L. A. Adamic, (cond 



mat/9901071) 



M.E.J. Newman, S.H. Strogatz, and D.J. Watts, 
Phys.Rev.E 64, 026118 (2001). 

M.E.J. Newman, Proc. Natl. Acad. Sci. (USA) 98, 404 
(2001); L. A. Amaral, A. Scala, M. Barthelemy, and H.E. 
Stanley, Proc. Natl Acad. Sci. USA 97, 11149 (2000). 
E.W. Zegura, K.L. Calvert, and M.J. Donahoo, IEEE/ 
ACM Trans. Network 5, 770 (1997); M. Faloutsos, P. 
Faloutsos, and C. Faloutsos, Comp. Comm. Rev. 29, 251 

(1999) . 

R. Albert, H. Jeong, and A.-L. Barabasi, Nature, 401, 
130 (1999). 

R.P. Kumar, S. Raghavan, D. Rajalopagan, and A. S. 

Tomkins, Proceedings of the 9th ACM Symposium on 

Principles of Database Systems, 1. 

S. Redner, Eur. Phys. J. B 4, 131 (1998). 

H. Jeong, B. Tombor, R. Albert, Z.N. Oltvani, and A.-L. 

Barabasi, Nature 407, 651 (2000). 

J.M. Kleinberg, R. Kumar, P. Raghavan, S. Rajagopalan, 
and A. Tomkins, Proceedings of the International Con- 
ference on Combinatorics and Computing. (1999). 
A. Broder, R. Kumar, F. Maghoul, P. Raghavan, S. Ra- 
jagopalan, and R. Stata, A, Tomkins and J. Wiener, Pro- 
ceedings of the 9th WWW conference, Amsterdam, 309 

(2000) . 

S.N. Dorog ovtsev and J.F.F. Mendes (cond 



mat/010614 



PL. Krapivsky, G.J. Rodgers, and S. Redner, Phys. Rev 
Lett. 86, 5401 (2001). 
B. Tadic, Physica A 293, 273 (2001) 



K.-I. Goh, B. Kahng, and D. Kim, (cond-mat/0108031) 



ACKNOWLEDGEMENTS 



We would like to thank A.-L. Barabasi for helpful 
discussions. This work was supported by the Korean 



G 



(I) 



P=9o+ f 



/ P=9 



/ (II) / 



(HI) 



9o 



FIG. 1: Phase diagram for different behaviors of the in- 
degree distribution. In the region (I), the in-degree distribu- 
tion shows P(fcin) ~ fcjn 2 , while P(fen) ~ l/fcin(ln km) 13 , with 
(3 = l+p/(g + a%/2-p) in the region (II), and j3 = l + go/ag 
in the region (III). 
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FIG. 2: Plot of the in-degree and out-degree distributions 
drawn in the cumulated way, P CU m(fc) = J, P(k)dk for the 
cases, p — 0.059, g = 0.017, and a 2 — 0.051 for (a), and 
a 2 = 0.021 for (b). Both cases belong to the region (I). 
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FIG. 3: Plot of the in-degree and out-degree distributions 
drawn in the cumulated way, P cum (fc) = f°° P(k)dk for the 
cases, p = 0.010, g = 0.017, and a 2 = 0.051, belonging to the 
region (II). The numerical data for the out-degree distribution 
shows the power-law behavior with the exponent 7 out w 1.8, 
and those for the in-degree does with 7i n ~ 1. 



